Non-mainstream Languages and Speech Recognition: Some Challenges
نویسندگان
چکیده
Most languages of the world have not been the focus of a speech recognition development effort, and the choice of technical approaches best suited to a language can be substantially impacted by the cultural context surrounding it. As the technologist and the teacher of or expert in a non-mainstream language and its culture are typically not the same person, issues that are self-evident to one may come as a surprise to the other. The goal of this paper is to add one plank to the bridge between these two areas of expertise by highlighting some aspects of non-mainstream linguistic contexts that pose challenges to the usual model of speech recognition system development and by suggesting alternative ways to meet these challenges.
منابع مشابه
Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods
For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملUsing Task-Oriented Spoken Dialogue Systems for Language Learning: Potential, Practical Applications and Challenges
The technology developed for task-based spoken dialogue systems (SDS) has a significant potential for Computer-Assisted Language Learning. Based on the CMU Let’s Go SDS, we describe two areas in which we investigated adaptations of the technology to non-native speakers: speech recognition and correction prompt generation. Although difficulties remain, particularly towards robust understanding, ...
متن کاملEmotiphons: Emotion markers in Conversational Speech - Comparison across Indian Languages
In spontaneous speech, emotion information is embedded at several levels: acoustic, linguistic, gestural (non-verbal), etc. For emotion recognition in speech, there is much attention to acoustic level and some attention at the linguistic level. In this study, we identify paralinguistic markers for emotion in the language. We study two Indian languages belonging to two distinct language families...
متن کاملChallenges in Speech Processing of Slavic Languages (Case Studies in Speech Recognition of Czech and Slovak)
Slavic languages pose a big challenge for researchers dealing with speech technology. They exhibit a large degree of inflection, namely declension of nouns, pronouns and adjectives, and conjugation of verbs. This has a large impact on the size of lexical inventories in these languages, and significantly complicates the design of text-to-speech and, in particular, speech-to-text systems. In the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006